Spam Detection using Generalized Additive Neural Networks

نویسندگان

  • Jan V. du Toit
  • David A. de Waal
چکیده

During the last decade the number of spam messages sent has increased significantly. These undesired emails place a heavy burden on end users and email service providers. As a result, a tenacious struggle to outsmart each other exists between people who send spam and the spam filter providers. Constant innovation is therefore of vital importance to curb the rapid increase of spam. In this article a Generalized Additive Neural Network (GANN) is harnessed to detect spam. This relative new type of neural network has a number of strengths that makes it a suitable classifier of spam. An automated GANN construction algorithm is applied to a spam data set. This method can perform in-sample model selection or cross-validation and produces results that compare favourable to other classifiers found in the literature. Even though neural networks in general are regarded as black box techniques, results obtained by GANNs can be interpreted by graphical methods. Partial residual plots assist the spam researcher to gain insight into the models constructed. Finally, by performing variable selection the temporal evolution of spam can be tracked. This feature ensures that the models can adapt to the ever-changing tactics of people who send spam with greater ease. Index terms Generalized Additive Model, GAM, Generalized Additive Neural Network, GANN, Neural Network, Spam, Spam detection.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Effective Model for SMS Spam Detection Using Content-based Features and Averaged Neural Network

In recent years, there has been considerable interest among people to use short message service (SMS) as one of the essential and straightforward communications services on mobile devices. The increased popularity of this service also increased the number of mobile devices attacks such as SMS spam messages. SMS spam messages constitute a real problem to mobile subscribers; this worries telecomm...

متن کامل

Neural Network Model for Email-Spam Detection

Email spam is a word that we come across in our daily life. The word spam means junk mails. The unsolicited emails that are received by any person in his/her mailbox are called spam. These junk mails are usually sent in bulk for advertising and marketing some products. This work presents a neural network approach to intrusion detection. A Multi-Layer Perceptron using Back Propagation Algorithm ...

متن کامل

Intelligent Word-Based Spam Filter Detection Using Multi-Neural Networks

SPAM e-mails have a direct cost in terms of time, server storage space, network bandwidth consumptions and indirect costs to protect privacy and security breaches. Efforts have been done to create new filters techniques to block SPAM, however spammers have developed tactics to avoid these filters. A constant update to these techniques is required. This paper proposes a novel approach which is a...

متن کامل

Additive Manufacturing Defect Detection using Neural Networks

Currently defect detection in additive manufacturing is predominately done by traditional image processing, approximation, and statistical methods. Two important aspects of defect detection are edge detection and porosity detection. Both problems appear to be good candidates to apply machine learning. This project describes the implementation of neural networks as an alternative method for defe...

متن کامل

An Investigation into the Use of Generalized Additive Neural Networks in Credit Scoring

Logistic regression occupies a central position in the field of credit scoring as it is relatively well understood and an explicit formula can be derived on which credit decisions may be based. Although artificial neural networks may be more powerful than logistic regression, it is not widely used in credit scoring because it is a black box with respect to interpretation and the absence of reas...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010